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Abstract. 

We investigate the translocation of a stiff polymer through a nanopore in a 
membrane, in the presence of binding particles (chaperones) that bind reversibly to 
the polymer on both sides of the membrane. A bound chaperone covers one (univalent 
binding) or many (multivalent binding) binding sites. Assuming that the diffusion of 
the chaperones is fast compared to the rate of translocation we describe the process by 
a one-dimensional master equation. We expand previous models by a detailed study 
of the effective force in the master equation, which is obtained by the appropriate 
statistical mechanical average over the chaperone states. The dependence of the force 
on the degree of valency (the number of binding sites occupied by a chaperone) is 
studied in detail. We obtain finite size corrections (to the thermodynamical expression 
for the force), which, for univalent binding, can be expressed analytically. We finally 
investigate the mean velocity for translocation as a function of chaperone binding 
strength and size. For both univalent and multivalent binding simple results are 
obtained for the case of a sufficiently long translocating polymer. 
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1. Introduction 

The problem of polymer translocation, i.e., the transport of an oligomer or polymer 
(e.g., DNA, RNA or proteins) through a nanopore in a membrane, is a process of 
fundamental importance in biology and biotechnology. Relevant biological examples 
of this type of process include: translocation of proteins through the endoplasmatic 
reticulum, translocation of RNA through the nucleus pore membrane, the viral injection 
of DNA into a host and DNA plasmid transport from cell to cell through cell walls pQ. 
Additionally, biotechnological applications connected to membrane pore passaging, such 
as rapid reading of DNA base sequences [21 El , analyte detection [I] , and nanosensor 
applications, have been suggested. In medicine, controlled drug delivery is an ultimate 
goal, a crucial element of which is the passage of cell and/or nuclei membranes. 

On the theoretical side there exist a number of investigations [51 El HI El E3 CHI HU 
O EH EH U51 UH UZ1 HE] , whose common approach to the translocation problem is to 
employ a one-dimensional description of the process using the penetration length into 
the pore as a slow variable ("reaction" coordinate); each translocation step is assumed to 
be sufficiently slow so that the polymer has time to relax to local equilibrium during the 
step (the instantaneous relaxation approximation JE]). The dynamics is then Markovian 
and can be described by a one-dimensional Smoluchowski (Fokker-Planck) or master 
equation ^T] in terms of the slow variable (however, as pointed out in [H] , this approach 
breaks down for very long polymers^). The force appearing in the Smoluchowski 
equation in general has entropic (chain confinement in the pore reduces accessible 
degrees of freedom ^2]) as well as external (electric field, chaperone binding etc.) 
contributions. Different theoretical studies have focused on different experimentally 
measurable entities: The mean translocation time is the most studied quantity [51 E]- 
More detailed studies investigated the probability density of translocation times [QUI]- 
Also the flux (number of polymers passing through the pore per unit time) has been 
theoretically investigated \1'2\ . As pointed out already there exist certain scenarios 
according to which the translocation dynamics becomes subdiffusive [HI E] • However 
in the present work we concentrate on a system whose dynamics is Markovian. 

Two important driving forces for translocation, both in vivo and in experimental 
assays, are (i) an electric field across the membrane and (ii) binding particles 
(chaperones). In this study the focus is on the latter mechanism which appears to 
be particularly common for protein translocation [TQJ HH1 EH 1221 El E3J but also of 
relevance for DNA transport through membranes [251 121] • I n the careful investigation 
by Simon et al. (see [22]) it was suggested that the translocation of proteins is a 
simple thermal ratchet process, i.e., the role of the chaperones is simply to prevent 
the backward diffusion through the pore, thereby speeding up translocation. In other 
studies the effect of the chaperones is modelled by using an effective force originating 
from the chemical potential difference due to the chaperones on the two sides of the 

| For very long polymers, the process becomes subdiffusive, and the Fokker-Planck equation may be 
replaced by its fractional analogue |2()j . 
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membrane El 13 El HZ]- More recently [TU], the coupled translocation-chaperone 
dynamics was investigated by Brownian molecular dynamics simulation, initiating more 
detailed studies of the chaperone-assisted translocation process. Although the studies 
above provide insights into the role of the chaperones in the translocation process, there 
is still no unified understanding. 

In the present work we perform a detailed theoretical investigation of the chaperone 
assisted translocation. The main contributions are (i) our result for the finite size 
correction to the force; (ii) that we include the possible occurrence of chaperones on 
both sides of the pore; and (iii) that we consider also chaperones which are larger than 
the size of a binding site. The paper is organized as follows: In section El we estimate the 
different relevant timescales of the problem, and in particular we distinguish between 
reversible and irreversible binding to the polymer. We also provide a general framework 
in terms of a master equation, which allows a theoretical description of the translocation 
dynamics. In section El we calculate the force on a stiff polymer in the reversible binding 
regime by the appropriate statistical mechanical average over the chaperone states. The 
polymer is divided into M equidistant binding sites to which the chaperones can bind, 
and we distinguish between the cases when a bound chaperone covers one (univalent 
binding) or several binding sites (multivalent binding). For the case of univalent binding 
we recover previous thermodynamical results for the force, however with a finite size 
correction. In section 0] we use the results from the previous section in order to compute 
the mean velocity of the polymer through the pore. For the case of sufficiently long stiff 
polymers simple results are obtained for both univalent and multivalent binding. In 
section El we compare the effectiveness of the chaperone assisted translocation to electric 
field driven translocation. Finally, in section El we give a summary and outlook. 

2. General framework and relevant timescales 

In this section we provide a general framework for describing sufficiently slow 
translocation dynamics in terms of a one- dimensional master equation. By estimating 
relevant time-scales we distinguish between three translocation regimes. 

The geometry considered in this study is shown in figure ^ A rod-like polymer is 
translocating through a narrow pore in a membrane (the pore is usually a few nanometer 
in size, corresponding to 10-15 nucleotides in the case of ssDNA and RNA translocation 
through the a-hemolysin channel). On each side of the membrane there are chaperones 
which can bind to the polymer. We here use the binding site size a (see figure as the 
basic unit of length; for the case of polynucleotides a may be the size of a base, and 
for unfolded proteins a may correspond to the size of an aminoacid. The translocation 
process is then described by the variable m, which is the number of binding sites on 
side B (the distance the polymer has entered into side B is x = ma) . The total number 
of polymer binding sites is M = L/o~ (L is the length of the polymer) and therefore 
for a given m the number of binding sites on side A is M — m. Denote by P(m, t) the 
probability that the polymer has passed with m binding sites into side B at time t. We 
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Figure 1. Translocation geometry in the presence of chaperones: A stiff polymer 
of length L is translocating through a pore in a membrane. The filled boxes of the 
two sides are chaperones. Each chaperone on side A (side B) occupies a volume vqa 
(vob), and binds to the polymer with a binding energy ca (es)- The volume of the 
compartment of side A (side B) is Va (Vb). The size of a binding site is a and 
when a chaperone is attached to the polymer on side A (side B) it occupies Xa (As) 
binding sites. The total number of available binding sites is M = L/a. The number 
of binding sites on side B is m = x/a, and the number of binding sites on side A is 
M - m = (L - x)/a. 



assume that P(m,t) satisfies a master equation: § 
d 

—P(m, t) = t + (m - l)P(m - 1, t) + t~(m + l)P(m + 1, t) 
at 

- (t + (m) +r(m)) P(m,t). (1) 

Equation (JIJ is our starting point for studying the chaperone assisted translocation 
process. Transitions can occur only one step in the forward or backward direction 
(i.e, m is increased or decreased by one)||. The transition probability for forward and 
backward motion is described by the transfer coefficients t + (m) and t~(m), respectively. 
In order to have a complete description we must specify t + (m) and t~(m) in terms of 

§ A general master equation has the form [221 dP(m,t)/dt = 
Em' (W(m\m')P(m' , t) — W(m'\m)P(m, t)), where W(m\m') are the transition probabilities 
per unit time. Assuming that transitions can only occur in unit steps, i.e., W{m\m!) = 
t + (m')6 m ,m'+i +t~(m')6 m ,m'-i we obtain equation (JTJ. 

|| This assumption is reasonable in view of the fact that the nanopore only allows a ID array of 
monomers (aminoacids, nucleotides) and that the passage is associated with a friction. 
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the fundamental parameters of the problem. We choose these entities such that the 
Smoluchowski (Fokker-Planck) equation is recovered in the limit a — > 0. This requires 

1 

t + (m) 

r(m) = f ( 1 - ^ ) , (2) 

with a characteristic time tq = a 2 /D and characteristic force Fq = ksT/a, where D is 
the diffusion constant for the polymer and T is the temperature of the solvent (ks is the 
Boltzmann constant). F(m) is the force acting on the polymer; in this work the focus 
is on obtaining relevant expressions for F(m) when both sides of the membrane contain 
a certain population of binding particles (chaperones), which can bind reversibly to the 
polymer, as is illustrated in figure ^ The chaperones are larger than the pore size, so 
that there is no exchange of chaperones between the two membrane sides. Typically the 
translocation is driven by the binding of chaperones on the exit side (side B). However, 
in vivo, chaperones are often present on both membrane sides (possibly the role of 
the chaperones on the entrance side is to unfold proteins before translocation \22\), 
and we therefore allow for the presence of chaperones in both compartments in figure 
^ For instance, protein import into mitochondria requires the presence of chaperones 
both on the cytosolic side (proteins belonging to the cytosolic hsp70 family) and on the 
mitochondrial side (mitochondrial hsp70) [1 . The use of a time-independent force in 
equation (J2J) relies on the assumption that the chaperone dynamics is fast compared to a 
characteristic translocation timescale (see below). For a flexible polymer (not considered 
in detail in this study) a time-independent force requires in addition that the relaxation 
time of the polymer is small compared to relevant translocation times. 

Let us now investigate how fast a polymer translocates through the pore, assuming, 
as in previous approaches, that once in the pore the polymer is not allowed to fully 
retract to the entrance side. Keeping in mind that the coordinate m is then confined 
to the interval < m < M, and that we have a reflecting barrier at m = 0, and an 
absorbing state at m = M + 1, the mean translocation time, for a process described by 
the master equation ((TJ), becomes |2Tj 

with 

•<•»>= n£g. w 

% In the continuum limit a — > 0, equations 0) and J5J satisfy the following Smoluchowski (Fokker- 
Planck) equation 27 dP{x,t)/dt = Dd/dx{-F(x)P(x,t)/k B T + dP(x,t)/dx} where x = mcr is the 
distance the polymer has entered into side B. In the equation above the Einstein relation D = ksT/^ 
(£ is the friction constant for the polymer) is implicit. 
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We have above assumed that at time t = we start with an initial condition at m = 0. + 
For later reference we introduce an average velocity 

(«> = - = ^o-M, (5) 
r r 

where we have defined a characteristic velocity Vq = D/a. It is often more illustrative to 
use (v) when discussing the results rather than the mean translocation time r. For the 
case of a constant force F(m) = F (so that the transition probabilities are independent 
of m, t + (m) = t + and t~(m) = t~) the mean translocation time can straightforwardly 
be calculated using equations (J2|) and Q, see | Appendix A The corresponding mean 
velocity, for large M (see equation (|A.2J) ). becomes 

(v)/v = t + -t~ = F/F . (6) 

Thus, for a constant force the mean velocity of a long polymer through the pore is simply 
proportional to the force acting on the polymer, i.e., the average motion is equivalent 
to classical motion, as it should. As we will see in the subsequent section the force 
due to the chaperones is such that it becomes constant for m > m , where m is some 
characteristic finite size correction length. For a sufficiently long polymer the finite size 
correction is negligible and the expression above can be used to obtain the mean velocity. 
However, very frequently the translocating chains are relatively short (for instance, small 
proteins are ~ 60 aminoacids long, see reference pQ pp. 117-118; or in vitro studies of 
single-stranded DNA translocation concern chain lengths well below 100 bases [Sj), and 
the finite size corrections do come into play. 

There are three relevant time scales associated with the problem: The time t^s 
needed for the polymer to diffuse a distance of the order of the binding site length cr; the 
typical time r unocc a binding site stays unoccupied; and the characteristic time r occ that a 
binding site remains occupied. Let us estimate these different time scales, assuming that 
there are no chaperones on the entrance side (side A) for simplicity:* The time needed 
for the translocating polymer to diffuse a distance a is simply r diff = r /2 = a 2 / 2D. 
Taking cr ~1 nm and D ~ 0.1 nm 2 /s(j we find r^m — 5 s. We now consider r unocc 
and r occ . Denote by D c the bulk diffusion constants for the chaperones and by c the 
bulk concentration of chaperones. The chaperones bind to the polymer site with a 
binding energy e (<0). Clearly the probability that a binding site is occupied depends 
on both the concentration and the binding energy, and we will see in the next section 
(see also |Appendix B ) that we can form a dimensionless number k = coK eq which is a 

+ In the continuum limit equation (0 becomes |2Zj r ~ L dx" exp(/3G(x")) J Q dx' exp(— (3G(x'))/D 
where L is the (contour) length of the polymer, G[x) — — F{x')dx' and x — ma. 
* We do not consider the characteristic time associated with one dimensional diffusion of binding 
proteins along the polymer. For instance, for non-specific binding of DNA transcription factors this 
introduces yet another time scale |28| . 

jj We here use the estimated effective diffusion constant D ~ 0.1 nm 2 /s for protein import into 
mitochondria in reference |29j. obtained by comparison to experimental translocation times. Notice 
that this value for the diffusion constant is orders of magnitude smaller than that of a freely diffusing 
polymer |29| . probably due to polymer interactions with the pore. 
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relevant measure of the binding strength, where we have defined an equilibrium binding 
constant K eq = v exp(/3|e|) (t> is the typical size of the chaperones, (3 = 1/(/cbT), 
with ks being the Boltzmann constant and T the temperature of the solvent as before). 
Considering univalent binding for simplicity, the equilibrium probability that a binding 
site is occupied is (see | Appendix Q P^ c = k/(1 + k), and the probability that a binding 
site is unoccupied is therefore P,^ occ = 1 — P^c — 1/(1 + ft). At equilibrium the ratio of 
these probabilities is equal to the ratio between the times r occ and r unocc , i.e. we have 
Tocc/Tunocc = ft. We now proceed by obtaining r unocc , which then through the above 
relation also determines r occ : Consider a binding site, which initially is vacant. If we 
assume that k is not too small, then as soon as one chaperone is at the binding site it 
becomes trapped and the binding site occupied. The distance between chaperones in 

— 1/3 

solution is R ~ Cq . It suffices for a chaperone to diffuse a distance of the order R 
for any one chaperone to attach to the binding site (provided k is sufficiently large), 
this takes a time r unocc ~ R 2 /D c ~ l/(c 2//3 -D c ), which then determines the characteristic 
time an initially vacant binding site stays unoccupied (see reference [30J for a more 
thorough investigation of this problem). ff Taking D c ~ 10 6 nm 2 /s and Cq ~10 fiM, 
we find r unocc ~ 1 ms. Thus typically the binding time is faster than the time for the 
polymer to diffuse a distance of the order one binding site. We note, however, that the 
above estimated numerical values are very crude, and in particular that the polymer 
diffusion constant D may deviate substantially from the result given here (D depends 
on the nature of the polymer-pore interaction). We found above that r occ = nr miocc , and 
thus if the binding strength k is large r occ can become large, even if r unocc is small. The 
considerations above allow us to distinguish between three different dynamical regimes: 

(i) Diffusive regime, t^e <C T unocc ,r occ . In this regime the diffusion through the pore is 
so fast that the chaperones do not have time to bind to the translocating polymer. The 
force in equation (j2J) is then essentially zero, and the mean translocation time equation 
(jnj) becomes simply r = tqM 2 /2. This regime can always be reached by lowering 
the concentration of chaperones sufficiently The diffusive regime corresponds to cases 
previously discussed (see for instance [HE]) and will therefore not be considered further 
in this investigation. 

(ii) Irreversible binding regime, r unocc <C r^m "C t occ . This regime corresponds to a 
situation when the particles have sufficient time to bind, however do not unbind from the 
polymer during the translocation. In this so called Brownian ratchet regime fl^\ \W\ |3] 
the particles, at sufficiently high concentrations and binding energies, bind immediately 
as soon as the polymer has diffused a distance equal to the size of a chaperone (or for 
univalent binding, a distance equal to the distance between binding sites). The binding 
of chaperones prohibits backward diffusion through the pore, and the polymer can only 
"jump" in the forward direction. This regime can formally be obtained by letting 
F = 2Fq in equation (J2J); then the backward transition probability is zero, t~ = 0, the 

ft Clearly, the depletion of chaperones due to binding to a neighbouring binding site could affect the 
time for binding. The concentrations of chaperones is expected to be sufficiently high so that this effect 
will not be dominant for estimating the relevant binding time. 
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forward transition probability per unit time is t + = 2/r , and hence the master equation 
(HJ) becomes 



This equation gives a coarse-grained description of the ratchet process, where only 
forward jumps are effectively allowed. The same type of equation appears also in the 
theoretical description of shot noise [27] . and in the continuum limit corresponds to the 
forward mode of the wave equation [HI]. The mean translocation time for ratchet motion 
becomes r = tqM/2 and hence the mean velocity (see equation (jSJ)) is (v) = 2vq, which 
agrees with the result of other studies ^Ql 122] ■ The ratchet mechanism gives a decrease 
of the translocation time by a factor a/L = 1/M compared to the translocation time in 
the diffusive regime described in (i) above. 

(iii) Reversible binding regime, T unocc ,r occ <C r^g. In this regime the particles have time 
to bind and unbind many times during the time it takes for the polymer to diffuse a 
distance o. The polymer thus has time to reach local equilibrium, and as we will see 
we can then obtain the force F{m) by the appropriate statistical mechanics average of 
the chaperone states. We will in the rest of this paper investigate this case of reversible 
(both univalent and multivalent) binding more closely. 

3. Force F(m) in the reversible binding regime 

In this section we investigate the force F(m) for reversible binding of chaperones to the 
translocating polymer. The chaperones are assumed to cover one binding site (univalent 
binding) or many binding sites (multivalent binding), when attached to the polymer. 

3.1. General expression for the force F(m) 

We start by deriving a general expression for the force F(m) on the translocating 
polymer, arising from the interaction with chaperones on the two sides of the membrane, 
in the reversible binding regime. 

Let us obtain a statistical mechanical expression for the force on the translocating 
polymer. Denote by Z(m, n^,ne) the Boltzmann-weighted number of configurations 
for a state specified by m, ua and tib, where ua {^b) is the number of attached 
chaperones on side A (side B). For two unconnected compartments (see figure 
this statistical weight can be written as the product of the statistical weight on side 
A and B respectively, i.e., Z(m, n^n^) = ZA(rn,nA)ZB(rn,n B ). This is a natural 
decomposition, as the binding proteins cannot cross the nanopore (at least not in the 
presence of the translocating polymer). The force then decomposes in the form 



i.e., the total force has independent contributions from side A and side B respectively. 
Let us proceed by writing down the appropriate statistical mechanical expression for 




(7) 




(8) 
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the force originating from side 7 (7= A or B). We have (see reference |l(Jj ) 




max 



(9) 



where p 7 q (m, n 7 ) is the equilibrium probability density for a state specified by m and n 7 . 
We have above used the fact that the explicit expression for the equilibrium distribution 
is p 7 q (m, n 7 ) = Z 7 (m, n 7 ) / Z n {m) , where the partition function Z 7 (m) for side 7 is 
obtained by summing the statistical weight over all allowed values of n 7 : 



where n™ ax is the maximum number of attached binding particles on side 7. Notice that 
this quantity depends on m. As before, we take F Q = ksT jo. The force, equation (JHJ), 
is given by weighting the derivatives, —d In Z 7 (m, n 7 ) / 9m, of the free energy of state m, 
n 7 by the equilibrium probability density (compare to equation (jC.l|l ). We note here 
that in order for the derivative in the force expression to be well-defined, the equation 
for the free energy must be analytically continuable to non-integer m (the expression 
for the free energy considered here are expressible in terms of factorials, which can be 
analytically continued through T-functions, see next subsection). We point out that the 
force F 7 (m) may in general incorporate other effects, for instance, as caused by electric 
fields or protein folding (see discussions in sections El and If the polymer is flexible 
chain entropic effects give an additional contribution to the force. Provided that the 
chaperone binding is independent of the curvature of the polymer, the binding force (as 
calculated in the next section) and the entropic force are additive, and the entropic force 
expression given in reference jH] can be used. For the sake of clarity, we here neglect 
entropic effects, i.e., we assume a rod- like polymer. This is not a strong restriction of 
the model, as in the presence of a drift as exerted by the chaperones (see below), the 
entropic effect is expected to be negligible for most chain lengths relevant to proteins, 
compare reference 0. 

Let us finally rewrite the force in a form convenient for obtaining F(m) in the 
thermodynamic limit (m — > 00 for side B and M — m — > 00 for side A). We write 



'7 




(10) 



n~,=0 




(11) 



with 




(12) 



and 
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where the partition function Z 7 (m) is given in equation (JTUJ). The force (jllj) is composed 
of two terms. The first term, explicitly given by equation (jl2j) . is the thermodynamic 
expression for the force obtained by taking the derivative of the logarithm of the partition 
function with respect to m. As we will see in subsections 13.31 and 13.41 this term is in 
general independent of m, and is proportional to the chemical potential difference across 
the membrane. The second term, given in equation (fT3*j) . is a correction term to this 
thermodynamic result. We note that the presence of this correction term is due to the 
fact that the upper limit n™ ax in the sum in (j!3)) depends on m (if this were not the case 
we could move the derivative in front of the sum and A 7 (m) would be identically zero). 
In subsection 13.31 we show that A 7 (m) vanishes for large m for the case of univalent 
binding. We will therefore henceforth call A 7 (m) a finite size correction term. We note 
here that in order to calculate the general force expression equation (JH)), we must evaluate 
(complicated) sums involving the statistical weights Z 7 (m, n 7 ). However, for obtaining 
the force equation ()12|) in the thermodynamic limit (i.e., large protrusion distances), 
a knowledge of the partition function Z 1 {m) suffices (as we will see in subsection 13 A\ 
Z^(m) can be straightforwardly calculated using a transfer matrix approach). 



3.2. Explicit expression for the force F{m) 

In this subsection we study in more detail the forces F^rn) and Fe(m) when the two 
compartments contain chaperones, which bind reversibly to the translocating polymer. 
In particular, we obtain the forces as a function of chaperone effective binding strengths 
and sizes. 

In order to obtain the force on the polymer we must have explicit expression for the 
statistical weights Z 7 (m, n 7 ) (see equation 0; 7= A or B). The details of the calculation 
of Z 7 (m, n 7 ) are given in Appendix B There are two entropic effects that must be taken 



into account: (i) as m increases the number of available binding sites increases on side 
B and vice versa; and (ii) as the number n 7 of bound chaperones increases the entropy 
of the surrounding "gas" decreases. We neglect the reduction of volume due to the 
presence of the translocating polymer. We assume that the chaperones are equal in size 
(univalent binding) or larger than (multivalent binding) the size of a binding site, and 
cover an integer A 7 (> 1) number of binding sites if bound to the polymer on side 7 (for 
instance, bacterial transcription factors cover ~ 10 — 20 basepairs |2E])- The statistical 
weights then become (see equation (|B.5jl ) 



Z 7 (m,n 7 ) = ^ ind (m,n 7 )^, (14) 

where f2^ ind (m, n 7 ) denotes the number of ways of arranging n 7 particles onto the m 
binding sites on side B or onto the M — m binding sites on side A. For the case of dilute 
solutions the effective binding strength k 7 appearing above can be written: 

k 7 = c 7 x; q (15) 

where c 7 is the concentration of chaperones on side 7, and K^ q is the equilibrium binding 
constant, see equation (jB.8|) . Since the effective binding strength k 7 is proportional to 
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the chaperone concentration, one can experimentally vary k 1 by changing the latter. 
We note that for A 7 > 2 there exist correlations between binding sites in the sense if one 
binding site is occupied, then at least one of the neighbouring binding sites is occupied. 
In turn, a binding protein needing more than one binding site to actually bind cannot 
bind between already bound chaperones if their distance is less than A 7 . Thus, the 
binding characteristics for large and small (A 7 = 1) are different. Explicitly we have for 

side B mm 

fif( m ,n B ) =( m ~ {Xb ~ 1)HB ) = (m ~ (Ag ~ lK , )! , (16) 
\ n B ) n B \{m-\Bn B )\ 

and similarly for side A with the replacement m — > M — m, n B —>■ n& and X B — > A^. 
We note here that we have above neglected cooperative effects (i.e., the effect that the 
chaperones attached to the polymer may interact). Such effects are usually incorporated 
into the theory through a cooperativity parameter uj 1 (221 E3 EH], where no cooperativity 
corresponds to u 1 = 1. For u; 7 > 1 (positive cooperativity) the chaperones bound 
to the polymer interact attractively, whereas for co> 7 < 1 (negative cooperativity) the 
chaperones repel each other. The (somewhat lengthy) relevant expressions for Z 7 (m, n 7 ) 
with c<j 7 ^ 1 can be found in reference [32J. In subsection 13.41 we revisit the problem and 
show that the partition function Z 7 (m) for large m may be straightforwardly obtained 
including cooperativity, allowing for a determination of the force F(m) (see equation 
([12))) in the thermodynamic limit. 

Equations (jHJ), (fTHjl and (fTHjl for the statistical weight Z 7 (m, ra 7 ) completely 
determine the effective force F y (m) (see equation 0). Combining equations © and 
([HI we straightforwardly obtain the force on the polymer from side A: 

„max 
TP ( \ A 

= - E Pa(™> n A ){<f(M -m-(\ A - l)n A + 1) - *(M - m - A A n A + 1)}.(17) 
Similarly the force from side S is: 

^r^= E PB q KM{^(^-(A B -lK + l)-^(m-A B n B + l)}, (18) 

^0 n s =0 

where ^(2) = dlnT(z)/dz = T'(z)/T(z) is the \l/-function [3^, and we have analytically 
continued the factorials appearing in equation (fTt^) using T-functions. The maximum 
number of particles that can attach to the polymer on side B is n^ ax = [m/X B ], i.e., it 
is the largest integer smaller than or equal to m/X B . Similarly for side A the maximum 
number of attached chaperones is n™ ax = [(M — m) / Xa] ■ We notice that the force from 
side B is zero, as it should, when the chain does not protrude at that side, i.e., we 
have F B (m = 0) = 0. Also, since ^(z) is an increasing function with z, the force from 
side B is positive F B (m) > 0, whereas the force from side A is negative Fa{itl) < 0. 
Equations (fTTj) and (|18|l are general expressions for the force, and are convenient for 
numerical computations. We point out that in general the force F(m) depends on 
five dimensionless variables: the binding strengths k,a an d k b , the relative sizes of the 
chaperones A^ and X B , as well as the effective length M = L/a of the polymer. In the 
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next two subsections we derive simplified approximate results for the cases: (i) univalent 
binding (A 7 = 1), and (ii) general multivalent binding and long polymers. In the latter 
subsection we also revisit the problem of cooperative effects (a; 7 7^ 1). In these two 
subsections we compare the results to the exact expressions for the force, equations (fTTj) 
and (fTHjl . 



3.3. Univalent binding 

In this subsection we consider the case of univalent binding, A 7 = 1, and obtain the 
force in the thermodynamic limit. We also derive a an approximate expression for the 
finite size correction to the force. 

For univalent binding A 7 = 1 we have = m and n™ ax = M — m and therefore 
£7 7 ind (m,n 7 ) = n™ ax !/(n 7 !(n 7 iax — n 7 )!) (see equation (JTHI) ) • The partition functions 
Za{iti) and Z B {m) can straightforwardly be calculated using equation (fTlj) . We find 



.max 

S 

- max 



Z 7 (m) = ]T Z 7 (m,n 7 ) = (1 + K 7 ) n " mx , (19) 



where we have used the binomial theorem [21]. This equation is a standard result for 
the partition function for univalent, non-cooperative binding to a polymer |37j. Notice 
that the dependence on the chaperone concentration and the binding energies appear 
only through the quantity equation ()15|) ). Let us now calculate the force F 7 (m) 

in the thermodynamic limit. Using equations (fT2|) and (fiT?|) we find: 

^ = ^ = ±ln(l + S ), (20) 

where the plus sign corresponds to side B, and the minus sign corresponds to side A. We 
notice that the thermodynamic force is independent of m, and the expression above can 
be viewed as a chemical potential difference across the membrane, compare references 

unman)]. 

We proceed by considering the finite size correction to the force, equation |T3|) . 
Replacing the sum over n 7 in equation (jlHj) by an integration and using Leibniz's theorem 
for differentiation of an integral [31] as well as the fact that Z(m, n™ ax ) = k 7 7 we find 
(for side B) 

-^"U^J Z B (m) = ItT^J =eM-m/m 0B ), (21) 

where tuqe = l/ln([l + kb]/kb) = l/h^/i? 1 ) m terms of the filling fraction fs of the 
polymer on side B as contained in equation (jC.2|) . The finite size correction for side A, 
A J 4(m), is obtained in an identical fashion. The finite size correction is exponentially 
decreasing with increasing m, and vanishes over distances larger than m 07 , where mo 7 is 
determined by the filling fractions on the two sides; for large (small) filling fraction, i.e., 
large (small) k 7 , the correction decays slowly (rapidly) with m. Figure E] shows the above 
expression for the force (equations (JTTJ), ([20]) and together with the exact result 

(equation (fT£|) ). Notice that the result above, i.e. the expression the force as given by 
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equations (|20|) and (|21|). captures the decrease of the force with decreasing m, but 
does not fully agree with the result from the general expression for the force (j!8|) . due to 
the continuum approximation leading to equation (j2H). We point out that a decrease of 
-Fe(m) for decreasing m agrees with the molecular dynamics simulations in reference [TP] , 
The results obtained in this subsection show that for a long (m > tuob) polymer the force 
can be calculated using the thermodynamic expression F y (m)/F = <91n Z^(m)/dm, 
but for short polymers finite size corrections become relevant, and one has to resort to 
the exact expressions for the force, equations (fTTjl and ijTBjl . In figure this fact is 
demonstrated for the case A# = 12, a typical value for binding proteins. 



3.4- General case, large m 

In this subsection we show that for large m the force F 7 (jn) can be obtained through 
the solution of an algebraic equation for general A 7 . The approach allows us to revisit 
the problem of cooperativity (o> 7 7^ 1) in a straightforward manner. 

In order to obtain the force for large m, a knowledge of the partition function Z 7 (m) 
suffices (see equation ()12j) ). Rather than using the combinatorial approach given in the 
previous subsections the partition function can more conveniently be obtained using 
the approach pursued in reference (see also [39 ): In general the partition function 
Z 7 (m) can be written as [33] 

A 7 + l 

Z 7 (m) = £ aj A?, (22) 
i=i 

where Aj are the A + 1 roots to the algebraic equation (j—A or B) 

a a 7 +i _ a a 7 _ + _ ^ = 0. (23) 

The prefactors ay are independent of m and are explicitly given by aj = Aj — 
XjdAj/dhiKj. Equation (j2"3*)) is the secular equation associated with the transfer 
matrix of the system [HE]- Equations (j22|) and f!23[) completely determine the partition 
function of the system. Notice that the above approach incorporates cooperativity 
effects (through the cooperativity parameter cj 7 ) without substantially raising the level 
of complexity. However, we point out that in the present approach only the partition 
function Z 7 (m), and not the statistical weights Z 7 (m, n 7 ), can be calculated. Therefore 
the approach discussed in this subsection does not allow computation of the exact 
expression for the force (see the definition of the force, equation (J§J), and also equations 
(HZj) and (HE}). 

Let us now calculate the force for large m. Denote by A™ ax the largest root to the 
algebraic equation equation (|23|). Then for large m the force equation (fT2"|) becomes 

M^ = |l«±lnA7* > ( 24 ) 

where the plus sign corresponds to 7 = B, and the minus sign to 7 = A. The force 
in the thermodynamic limit is hence proportional to the logarithm of the largest root 
A™ ax . Equation (|23|) can straightforwardly be solved on a computer, and hence the 
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Figure 2. The force F(m)/Fo in units of Fo = ksT/a (T is the temperature of the 
solvent, ks is the Boltzmann constant and a is the binding site size, see figure^). The 
dashed lines correspond to approximate results: For the cases of univalent binding 
\b = 1 the upper dashed line is the analytic result as contained in equation J3UJ. 
For the case Xb = 2 we have included a plot (middle dashed line) of the approximate 
result contained in equation l|25[) . For Xb = 12, a typical value for many DNA-binding 
proteins, the lower dashed line corresponds to the result obtained through equations 
(I23|l and l|24|) . The remaining curves corresponds to the results obtained through the 
exact expression equation (|18|l . for: Xb = 1 (upper curve), Xb = 2 (middle curve) 
and Xb = 12 (lower curve). The binding strength (see equation dJ) was taken to be 
kb = 1- No binding particles were assumed to be present on side A. Notice that for 
small m (and > 2) the force "oscillates" with a period Xb- For large m the force 
approaches a constant value. The onset of the force is where m = Xb, i.e., the force is 
zero unless there are sufficiently many binding sites on side B to accommodate at least 
one chaperone. The solid lines are only meant to guide the eye. For typical chaperone 
sizes and translocating polymer lengths the finite size corrections may become quite 
relevant. 
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determination of the force for large m is a simple matter. For univalent, non-cooperative 
(u 1 = 1) binding equation (}2*3*|) becomes a first order algebraic equation which can be 
easily solved. The force as obtained from this solution together with equation (|24jl agrees 
with equation (j2*Uj) as it should. In the previous subsection we showed that the force in 
the thermodynamic limit is independent of m for the case of univalent, non-cooperative 
binding. The result above proves that the force, for large m, is independent on m for 
general values of A 7 and u> T We have in figure El plotted the force for multivalent binding 
using the exact result, equation (fTSJ) . as well as the above result Fe(m). The agreement 
is good for large m. Notice that the exact result has an "oscillatory" behaviour with a 
period Xb for small m-values. We interpret the these oscillations in the following way: 
if m is equal to an integer multiple of the polymer can, potentially (for large binding 
strengths), fill the polymer and hence completely restrict backward motion (perfect 
'ratcheting'). However, when m is not an integer multiple of Xb there must be vacant 
spaces in between bound chaperones (for instance for m = 5 the maximum number of 
bound chaperones is 2 for divalent binding (Xb = 2), and hence there must be at least 
one vacant binding site, even for large binding strengths), and the 'ratchet' effect is 
less pronounced. In reference j!32 similar types of oscillations were found in the filling 
fraction of a polymer as a function of m, for the case of multivalent binding. 

For divalent (A 7 = 2), non-cooperative binding equation (|2*3j) becomes a second 
order algebraic equation, which can straightforwardly be analytically solved, yielding 
A = [1 ± (1 + 4k 7 ) 1 / 2 ]/2. The corresponding force (for large m) becomes 



where the plus sign corresponds to 7 = B, and the minus sign to 7 = A. We point out 
that the force for divalent binding as given by equation (J25|) has a different functional 
dependence on k 7 compared to the case of univalent binding, see equation (|2*Uj) . 

Let us finally obtain the force for large m, univalent binding (A 7 = 1) and 
including cooperativity effects (arbitrary cj 7 ) using the approach above. For this 
case equation (|23j) becomes a second order algebraic equation with the roots: A = 
(1 + co> 7 /t 7 )/2 ± {(1 +u; 7 k) 2 /4 + (cu 7 — l)/^} 1 / 2 . Hence the force, equation (|2*4*)) . becomes 



are real. We note that W(kj) has a maximum value 1/2; therefore equation (|2fi|) applies 
whenever the cooperativity parameter satisfies co> 7 > 1/2. In addition, when k 7 < 1/4 
we find that W(k^) is negative. Hence equation (j2E| is valid for any value of the 
cooperativity parameter u; 7 provided that k 1 < 1/4. For no cooperativity cj 7 = 1 the 
above result reduces to previous results (see equation (}20|) ). Notice that the force for the 
case of positive cooperativity is larger than the force for negative cooperative binding, 
as it should. 




(25) 




(26) 



where we have assumed that uj^ > W(kj) = 3 [2 (2 + k 7 ) 



1 / 2 /3 — l]/« 7 so that the roots 
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4. Mean velocity 

In this section we study the mean velocity for the polymer translocation. In particular 
we find a simple form for the mean velocity for long polymers. 

The mean velocity is obtained on the basis of the force obtained in the previous 
subsections together with equations (jHJ) and (jHJ). For the general case (finite sized 
polymers) the force expressions as contained in equations (fTTj) and (fTHj) must be used. 
For sufficiently long polymers we can ignore the finite-size effect and use equation (JHJ) 
together with the thermodynamic expressions for the force derived in the previous 
subsections. We point out that (v) then (for non-cooperative binding) depends on 
four dimensionless variables: ka = caK a ^, k b = cbK^, \a, \b (c 7 is the concentration 
of chaperones on side 7 and is the equilibrium binding constant for the chaperones 
on side 7, r y=A or B). We can therefore for sufficiently long polymers write the mean 
velocity according to 



with vq = D/a and Fq = ksT/a. The relevant expressions for the forces Fa and Fb were 
given in the previous section: (i) For the general case the forces follow equation (f2"^j) 
and the problem is that of determining the largest root A max to the algebraic equation 
(j23|) . (ii) For univalent binding (A 7 = 1) we use the force expressions according to 
equation (j2HI)- For the case of no cooperativity, co> 7 = 1, this equation reduces to the 
simple result given in equation (|2"U|). (iii) For divalent binding (A 7 = 2) and non- 
cooperative interactions (u 1 = 1) the forces are given by equation (f23j) . When the 
chaperone baths on the two sides contain chaperones of identical binding strengths 
k>a = Kb and sizes Xa = A# the mean velocity is zero, as it should. In general, however, 
the size of the chaperones on side A and side B may differ A^ 7^ A#, which may lead 
to interesting behaviour of the mean velocity as a function of ka and Kb (i.e., of the 
chaperone concentration on the two sides). In particular we notice that the dependence 
on k 1 differs between the cases of univalent and divalent binding (see equations (f2*U|) 
and ()25j) ) for non-cooperative binding. The binding strength k 7 is proportional to the 
concentration of chaperones on the two sides. Thus by measuring the mean velocity as 
a function of concentration of chaperones, it should be possible to reveal the nature of 
the binding on the two sides (i.e., the values of A 7 and uj 7 ). In figure El we have plotted 
(v) as a function of Kb for different A#, assuming no chaperones to be present on side A 
for simplicity. The solid lines in figure El correspond to the mean velocity as calculated 
using equations © , © and the exact expression for the force equation ([!%)) . The dotted 
lines corresponds to the approximate results obtained from equations (f2T|) . (f2"U|) . (f23|) 
and ()24|) . We notice that the deviation between the above approximate results and the 
exact result for the mean velocity is larger for larger values of k t This originates from 
the fact that for large k 1 the finite size correction to the force is more pronounced (see 
equation (j2U)- For finite sized polymers the finite size correction to the force found 
here thus a non-negligible plays a role in the translocation dynamics. However as we 




(27) 
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increase the length of the polymer the approximate results (dashed lines) as given above, 
coincide with the exact result (solid lines). 

A few words on the experimental relevance of the finite size effects are in order. 
From figure 121 we notice that finite size effects are prominent for the force for m- values 
up to m < (3 — 4) A# for the Kb- value chosen in the figure; thus typically the larger the 
chaperones (larger A#) the more pronounced is the finite size effect for a given polymer 
length. We note here that it might be possible to measure the finite size effect of the force 
directly; for instance by attaching a bead at one end of the polymer and trapping the 
bead in an optical tweezer, one might directly probe the force on the polymer due to the 
presence of chaperones, compare to the experimental setup in reference j2S|- The force 
is, however, not the usual experimental observable. Instead, what is usually obtained in 
experiments is the mean velocity (or mean translocation time). As seen in figure El the 
finite size effects are effective also for "long" polymers (M > (3 — 4) As). This is due to 
the fact that the mean velocity, also for long polymers, contains information about the 
dynamics in the small m-regime. 



5. Comparison to electric field induced translocation 

In this section we compare the binding assisted translocation to electric field induced 
translocation. As we have seen in the previous sections the characteristic velocity due 
to binding of chaperones along a polymer is vq = D/a, where D is the polymer diffusion 
constant and a is the distance between binding sites. We now compare v to the velocity 
due to the the presence of an electrostatic voltage AV across the membrane. If we denote 
the linear charge density of the polymer (charge per unit length) by p, the electric force 
on the translocating polymer is _F e i cc = pAV, and hence the velocity is 

F elec DpAV 

Velec = —T- = , 28 

f kBT 

where £ = ksT/D is the friction constant for the polymer. Setting t> = t> e i ec we find 
that we need a voltage across the membrane 

7 rp 

AV = AV t = — (29) 
pa 

in order to get a velocity from the electric field which equals the velocity due to the 
chaperones. Let us estimate the voltage for a (highly) negatively charged polymer like 
DNA. We then take p ~ 5 unit charges/nm, ksT ~ 26 meV (room temperature) and 
a ~ 1 nm, which gives AV t — 5 mV. The typical (resting) potential across the eukaryotic 
cell membranes is ~ 70 mV. Thus for charged polymers like DNA it is "preferable" to 
use electric fields for efficient transport. In contrast, the linear charge density of a 
protein is sensitive to the aminoacid sequence and the pH of the solution; at high (low) 
pH a protein is typically negatively (positively) charged. Therefore, non-specific protein 
transport cannot in general rely on electric field induced translocation; this may explain 
in parts why nature has invented the chaperone assisting machinery. 
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Figure 3. The mean velocity (v) for translocation of a finite sized polymer as a 
function of binding strength kb = CBK B q , for different relative chaperone sizes Xb 
(cb is the concentration of chaperones, and K B q is the equilibrium binding constant). 
The solid lines corresponds to the mean velocities as calculated using equations 

and the exact expression for the force as contained in equation Ijl8(l . The dashed 
curve in connection with the Xb = 1 line, is the approximate result given in equations 
(|27[) together with equation (|2U|) . The dashed curve with the Xb = 2 line is the 
approximate expression given in equation (|27|) and H25|) . For = 12 the dashed 
line corresponds to the result obtained through equations l|23|) and (|24|l . No binding 
particles were present on side A. The effective length of the polymer was taken to be 
M = 60. The mean velocity increases monotonically with increasing binding strength 
kb- Notice that the deviations between the exact (solid curves) and the approximate 
results (dashed curves) are more pronounced for large values of kb ■ 

6. Summary and outlook 

We have in this work investigated the translocation of a stiff polymer through a nanopore 
in a membrane, in the presence of binding particles (chaperones) that bind to the 
polymer on both sides of the membrane. Assuming that the diffusion of chaperones is 
fast compared to the rate of translocation we described the process by a one-dimensional 
master equation. We closely investigated the translocation dynamics for the case of 
reversible binding to the polymer and found that the dynamics depend on whether the 
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chaperones bind univalently or multivalently to the polymer. For the case of univalent 
binding we derived an analytic finite size correction to the force exerted on the polymer 
by the chaperones. In general, the finite size corrections we quantified in this study 
may be used to extract information on the nature of the chaperones from experimental 
data. For long polymers a simple expression for the mean velocity of the polymer 
through the pore was found. We also discussed the problem of irreversible binding 
to the translocating polymer, as well as compared the effectiveness of binding assisted 
translocation to electric field driven translocation. 

We want to point out that the case of perfect thermal ratchet translocation [22] 
(immediate irreversible binding) cannot be obtained by simply taking the effective 
binding strengths to be infinite in the results in this study. As discussed in section EJ for 
irreversible binding the chaperones do not have time to unbind during the translocation, 
rendering a thermodynamic evaluation (as we have done here) of the force inapplicable. 
However, we found that our master equation approach allows us to formally describe 
the case of thermal ratcheting. It will be interesting to see whether it is possible to 
develop a theory that covers both the reversible and irreversible binding regimes (i.e., 
arbitrary values of the binding strength). Possibly techniques and results from the class 
of the parking lot models |IUJ HU E2] could prove useful. 

We have in this investigation not included entropic effects due to polymer flexibility. 
As noted in the main text, provided that the chaperone binding does not depend on the 
curvature of the flexible polymer the binding force as calculated here and the entropic 
force are additive. For not too long flexible polymers the entropic effect could thus be 
included in a standard fashion (see e.g., [3]), but typically these effects in the presence 
of the chaperone-generated drift will be negligible for systems relevant to this study 
jH]. For very long polymers the dynamics changes qualitatively, and has to be modelled 
by a dynamical equation with memory [5|. We have also neglected the volume of the 
translocating polymer, which should be a fair assumption for the relevant biological 
systems. 

We have throughout the study assumed that the binding energy for the chaperones 
is the same along the polymer. However proteins, RNA and DNA in general consist 
of heterogeneous sequences of aminoacids, bases or base-pairs respectively. It would 
therefore be interesting to investigate how heterogeneity in the binding energies along 
the polymer affects the translocation dynamics. 

It has been suggested that in order for a protein to be able to translocate it has 
to be unfolded on the entrance side [221 • The unfolding of a protein in general requires 
the presence of chaperones on the entrance side; as we have seen in this study the 
presence of such proteins always give an opposing force compared to the translocation 
direction. Hence efficient translocation requires that the amount of binding proteins 
on the entrance side is large enough to allow unfolding, but small enough not to cause 
a too large opposing force. It will be interesting to see whether such an optimization 
concerning the concentration of "unfolders" is indeed used in nature. This situation may 
be improved by additional protein channels for the chaperones, by which the relative 
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concentration on both sides of the membrane may be actively regulated. The possibility 
also exists that the translocation could be driven by the refolding of the protein on 
the target side [22] • Alternatively, in cases where the protein is synthesised on the 
entrance side a built-in additional sequence can inhibit folding. This folding-preventing 
sequence then has to be removed on the exit side and then the folding process may 
assist the translocation. We note that effects similar to protein folding can occur for 
RNA and single stranded DNA in the form of secondary structure. In principle protein 
translocation could occur even for an unfolded protein, provided that the chemical 
or electric bias is strong enough. The translocation dynamics could in such case 
provide local information about the protein structure, which could find biotechnological 
applications, similar to RNA translocation [35] . 
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Appendix A. Mean translocation time for a constant force 

Let us consider the mean translocation time, as given by equation (JHJ). For the 
case of a constant force F(m) = F the transfer coefficients, equation (J2J), become 
t+( m ) = (l/r )(l + F/2F ) = t + and t~{m) = (l/r )(l - F/2F ) = t~. Equation 
(JHJ) then becomes: 

1 M (t-\ v v (t + Y 

5(f) • 

If we now assume that t~ < t + and use the result for a geometric series (valid for 
Q 7^ 1) YX=q Q z — (1 ~~ Q v+1 )/i)- — Q) we fi n d t ne following expression for the mean 
translocation time 

M t+ [ ft~\ M+1 ] M MF 

1- 7- = (A.2) 



t+ - t~ (t+ - t-f 1 \t+ ) J t+ -t 

where we in the last step have assumed that M 1. For large M (and a constant force), 
the mean translocation time is thus inversely proportional to the force, as it should. 



Appendix B. Binding partition function 

In this appendix we obtain the binding statistical weight Z 7 (m, n 7 ). 

In order to obtain the force on the polymer we must have explicit expressions for the 
statistical weights Z 7 (m, n 7 ) (see equation 0). This quantity is obtained by calculating 
the statistically averaged number of ways to attach n 7 particles on side 7, divided by a 
reference statistical weight Z™ f . We choose Z™ { as the number of states in the absence 
of the polymer (notice that the choice of Z^ ei is arbitrary, since it vanishes in equation 
©)• Let us first calculate Z* ei . Denote by Va the volume of compartment A and by 
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Vg the volume of compartment B. Similarly, we assume a chaperone on side A (side B) 
to occupy a volume v a (v g). There are hence N A ot = Va/v a number of voxels to put 
the chaperones on side A (see figure EJ, and similarly iV]j ot = Vg/vog number of voxels 
to put the chaperones on side B. If we furthermore assume that there are Na (Ng) 
chaperones on side A (side B) , the number of ways of arranging these particles on the 
two sides are 

Z« = ( Nt A = (iV " t)! (B 1) 

7 ~ \nJ N 7 \(N* ot — N y )\ ' 1 • ; 

which then determine the reference statistical weight. Let us proceed by calculating the 
statistical weights Z 7 (m, n 7 ) in the presence of the polymer, neglecting the reduction 
of compartment volume due to the translocating polymer. As before, on side B the 
polymer is divided into m segments such that m = x/a, where x is the protrusion 
distance on side B and a is the size of a binding site. Similarly on side A there are 
M — m = {L — x)/o~ segments, where M = Ljo is the total number of binding sites (L 
is the length of the polymer). We assume that the chaperones cover an integer A 7 (> 1) 
number of binding sites if bound to the polymer on side 7. The total binding energy for 
an attached chaperone is denoted by e 7 (< 0). The maximum number of particles that 
can attach to the polymer on side B is then n^ ax = [m/Xg], i-e., it is the largest integer 
smaller than or equal to m/Xg. Similarly for side A the maximum number of attached 
chaperones is n™ ax = [(M — w)/Xa\- Denote by Q B md (m, ng) the number of ways of 
arranging ng particles onto the m binding sites on side B (Q B md (m, n 7 ) is explicitly 
given in the main text). To obtain the binding statistical weight for side B we have to 
multiply Q B md (m,ng) by the Boltzmann weight associated with binding, i.e., 

Z^ ind (m, n 7 ) = fi 7 ind (m, n 7 ) exp(-/3e 7 n 7 ). (B.2) 

In order to obtain the full statistical weight for side B, we also have to account for the 
fact that when n 7 number of particles are bound to the polymer there are only A^ 7 — n 7 
numbers of molecules left in the "gas" surrounding the polymer. The number of states 
for the "gas" (compare equation ([B.ljl ) is 

= {« t )!}/{(iV 7 - n,)\(N^ - (iV 7 - n 7 ))!}. (B.3) 

For large A^ 7 (see equation 6.1.47 in reference [33]) we have the identity (N + a)\/(N + 
6)! = r(N + a + 1)/T(N + b + 1) w A^- b (l + (a - 6) (a + b + 1)/(2N) + ..), where T(z) 
is the T-function. Applying this result, we find that for the case when a large number 
of chaperones are present at the two sides: 

Zf s (n 7 ) = $^, (B.4) 



where we have introduced the volume fraction on side 7 as $ 7 = A^ 7 / (A^* — Nr 



Combining equations ()B.2j) and (|B.4J) we find that the statistical weight for side 7 is: 

Z 7 (m,n 7 ) = Zf s (n 7 )^ ind (m,n 7 ) = ^ ind (m,n 7 )^, (B.5) 
where we have defined an effective binding strength 

n-y = $ 7 exp(/?|e 7 |). (B.6) 
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For the case of dilute solutions (iV 7 <C iV* ot ) the effective binding strength can be written 
in the common form: 

k 7 = c 7 ^ q (B.7) 

where c 7 = N^/V y is the concentration of chaperones on side 7 (j=A or B), and 

K^ = v ,exp(f3\e,\) (B.8) 

is the equilibrium binding constant Equation (|B.5)1 together with equation ()B.7jl 
defines the statistical weights for the two sides. 

Appendix C. Filling fraction 

In this appendix we consider the filling fraction of chaperones bound to the translocating 
polymer for the cases univalent and divalent of binding, respectively. 

Since we assume that the motion of the chaperones is fast compared to the rate 
of translocation through the pore, the expected numbers (tia) and (ng) of bound 
chaperones on the two sides for a given m are well-defined quantities, which are 
simply obtained by calculating the expectation value with respect to the equilibrium 
distribution, i.e., 

„ max 

7 d 
(%) = Yl n-yP^im,^) = — — (InZy(m)), (C.l) 

7ly=0 7 

where r y=A or B and the total expected number of bound chaperones is (n) = 
(n A ) + (n B ). 

Let us calculate the expected number of bound particles on the two sides for the 
case of univalent A 7 = 1 and non-cooperative binding lu 7 — 1. Using equations (|T§j) and 
Kill we find (n 7 ) = n™ 3X -f 1 where the filling fractions are 

f y=T Jh- = l-(l + K 7 )- 1 . (C.2) 

1 ~r ^ 7 

This finding is a standard result for univalent, non-cooperative binding to a polymer 
|37| . We notice that < / 7 < 1, and that the chain becomes fully occupied, / 7 — > 1, if 
the binding strength is very large k 1 — > 00. When the binding strength is zero k 7 — >• 
there are no chaperones bound, / 7 — > 0, as it should. For positive binding energies 
e 7 > (repulsion) the filling fraction loses its meaning. Since for univalent binding (and 
no cooperativity) the binding sites are independent, the equilibrium probability P^ c 
that a binding site is occupied equals the filling fraction, i.e. P^? c = / 7 . 

We can also calculate / 7 for the case of divalent binding A 7 = 2, and large protrusion 
distances. Using the results from subsection 13.41 we find that 

/ 7 «l-(l + 4« 7 )- 1 /2 ) (C.3) 

for divalent binding. We have that < f 1 < 1, / — > 1 for k 7 — > 00, and / 7 — > 
for K y — > as it should. We notice that for divalent binding the polymer reaches its 
fully occupied state / 7 = 1 "slower" with k 7 than for the case of univalent binding (see 
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equation (|U.2p ). f This is intuitively clear, as for divalent binding those configurations 
have to be overcome in which vacant spots of the size of one binding site have to 
disappear in order to reach / 7 = 1. 



f Equations l)C.2(l and l)C.3(l also follows straightforwardly from the McGhee-von Hippel binding 
isotherm : 



which expresses the fraction / 7 of occupied binding sites as a function of k 7 and A 7 for binding to an 
infinitely long polymer. From the above expression we notice that for large chaperones A 7 ^> 1 the 
filling fraction takes the simple form / 7 = 1 — (1 + X^Ky) . 
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